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GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



June 12, 2004, 13:33:34 ; Search time 67396 Seconds 

(without alignments) 
17817.359 Million cell updates/sec 



Title: 

Perfect score : 
Sequence : 

Scoring table: 



€) 



US-09-942-025 

27705 1 

1 gtggcgccgatggacagaat cagaggatctccatgaatag |27705 [- -y^cl^f^de^ 

0LIG0_NUC J 
Gapop 60.0 , Gapext 60.0 



Searched : 



r 



Word size 



3470272 seqs, 21671516995 residues 



200 



SOL 



ocj- 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Listing first 500 summaries 



Database : 



GenEmbl : * 



1 
2 
3 
4 
5 
6 
7 
8 
9 

10 
11 
12 
13 
14 
15 
16 
17 
18 
19 
20 
21 
22 
23 
24 
25 
26 
27 



gb_ba : * 
gb_htg: * 
gb_in-: * 
gb_om : * 
gb_ov : * 
gb_pat : * 
gb_ph : * 
gb_pl : * 
gb_pr : * 
gb_ro : * 
gb_sts : * 
gb_sy : * 
gb_un : * 
gb_vi : * 
em_ba : * 
em_f un : * 
em hum: * 



8/t 



t 



[ic&t ex. ?KS dr>v\4/n 



* 

■k 

•k 



em^in 
em_mu 
em_om 
em or 
em ov 
em_pat : * 
em__ph : * 
em__pl : * 
em_ro : * 
em st s : * 



1 



28 


: em 


un : * 


29 


: em 


vi : * 


30 


: em 


htg hum : * 


31 


: em 


htg inv : * 


32 


: em 


htg other : * 


33 


: em 


htg mus : * 


34 


: em 


htg pin : * 


35 


: em 


htg rod:* 


36 


em 


htg mam : * 


37; 


em 


htg vrt : * 


38: 


em 


sy : * 


39: 


em 


htgo hum : * 


40: 


em 


htgo mus : * 


41 : 


em 


htgo other : * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



o 
o 



Result Query 

No. Score Match Length DB ID Description 



1 



14165 



51.1 33529 6 AR166425 



AR166425 Sequence 



ALIGNMENTS 



RESULT 1 
AR166425 
LOCUS 

DEFINITION 

ACCESSION 

VERSION 

KEYWORDS 

SOURCE 

ORGANISM 

REFERENCE 
AUTHORS 
TITLE 
JOURNAL 

FEATURES 

source 



AR166425 ^ 33529 bp DNA 

Sequence 3 from patent US 6280999. 
AR166425 

AR166425.1 GI:16241741 



linear 



PAT 17-OCT-2001 



Unknown . 

Unknown . 

Unclassified . 

1 (bases 1 to 33529) 

Gustafsson, C. , Betlach, M. C. , Ashley, G., Julien,B. and Ziermann, R 
Sorangium polyketide synthases and encoding DNA therefor 
Patent: US 6280999-A 3 28-AUG-2001; 

Location /Qualifiers 

1. .33529 

/ o r gani sm= " unknown " 
/mol_type="unassigned DNA" 



ORIGIN 



Query Match 51.1%; Score 14165; DB 6; Length 33529; 

Best Local Similarity 99.9%; Pred. No. 0; 

Matches 15255; Conservative 0; Mismatches 5; Indels 7; Gaps 7; 

Qy 12439 GATCCCCAGCAGCGGCTGGTGCTGGAGACGGCGTGGGAGGCATTGGAGCGTGCCGGCGTG 124 98 

I M I I I I I I I M I I II I I I I I I I I I I I i I I I I I I | | | | | | | M I I II I I I I I I I I I I I I I 
Db 1 GATCCCCAGCAGCGGCTGGTGCTGGAGACGGCGTGGGAGGCATTGGAGCGTGCCGGCGTG 60 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



June 12, 2004, 13:31:29 



Search time 5997 Seconds 
(without alignments) 
19625.868 Million cell updates/sec 



Title: 

Perfect, score: 
Sequence : 

Scoring table: 



Searched : 
Word size : 



US-09-942-025-12 
27705 

1 gtggcgccgatggacagaat 



cagaggatctccatgaatag 27705 



OLIGO_NUC 

Gapop 60.0 , Gapext 60.0 
3373863 seqs, 2124099041 residues 
200 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Listing first 500 summaries 
Database : 



7 



N_Geneseq_2 9Jan04 : * 
1 : geneseqnl98 0s : * 
geneseqnl990s : * 
geneseqn2000s : * 
geneseqn2001as : * 
geneseqn2001bs : * 
geneseqn2002s : * 
geneseqn2003as : * 
geneseqn2003bs : * 
geneseqn2003cs : * 
geneseqn2004s : * 



2 
3 
4 
5 
6 
7 
8 
9 

10 



Pred. No 



is the number of results predicted by chance to have a G tV \h> 

score greater than or equal to the score of the result being pr^te^ v ^ cj^ 
and is derived by analysis of the total score di .^rihnHr,i>-> cK ^5? 

/J* 



Result 
No. 



and is derived by analysis of the total score distributioju 

SUMMARIES 

Query A P T 

Score Match Length DB ID 



1 


25767 


93. 


0 


27705 


9 


2 


20582 


74 . 


3 


67251 


9 


3 


14165 


51. 


1 


33529 


5 


4 


752 


2. 


7 


20922 


9 


5 


480 


1. 


7 


11358 


9 


6 


368 


1. 


3 


750 


3 


7 


220 


0. 


8 


787 


3 




Description 



ADC26979 

ADC26995^ q 
AAS17 367-IA (#12°^" 

ADC2698L 
ADC26983 

AAA58 917- & UNCUS' 
AAA58916^ * 



Adc26979 
Adc26995 
Aasl7367 
Adc26981 
Adc26983 
Aaa58917 
Aaa58916 



Sorangium 
Sorangium 
DNA seque 
Sorangium 
Sorangium 
DNA encod 
DNA encod 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on : 



June 12, 2004, 14:52:34 



Search time 1125 Seconds 
(without alignments ) 
13666.591 Million cell updates/ sec 



Title: 

Perfect score 
Sequence : 

Scoring table 



Searched: 



Word size : 



US-09-942-025-12 
27705 

1 gtggcgccgatggacagaat 



cagaggatctccatgaatag 27705 



OLIGO_NUC 

Gapop 60.0 , Gapext 60.0 
682709 seqs, 277475446 residues 
200 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Listing first 500 summaries 



Database : 



1 
2 
3 
4 
5 
6 



Issued_Patents_NA: * 

/cgn2 6/ptodata/2/ ina/5A COMB. seq:* 
/cgn2 6/ptodata/2/ ina/5B COMB. seq:* 
/cgn2 6/ptodata/2/ ina/6A COMB. seq:* 
/cgn2 6/ptodata/2/ ina/6B COMB. seq:* 
/cgn2_6/ptodata/2/ ina/PCTUS COMB. seq:* 
/cgn2_6/ptodata/2/ina/backfilesl . seq: * 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



Result 
No. 



Score 



Query 

Match Length DB 



ID 



Description 



1 14165 51.1 33529 3 US-0 9-14 4 -085-3 

2 368 1.3 750 3 US-0 9-010-80 9-12 

3 220 0.8 787 3 US-09-010-809-11 



Sequence 3, Appli 
Sequence 12, Appl 
Sequence 11, Appl 



ALIGNMENTS 



RESULT 1 
US-09-144-085-3 

; Sequence 3, Application US/09144085 



GenCore version 5.1.6 
Copyright (c) 1993 - 2004 Compugen Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



June 12, 2004, 14:57:19 



Search time 6879 Seconds 
(without alignments) 
18383.440 Million cell updates/ 



Title : 

Perfect score 
Sequence : 



US-09-942-025-12 
27705 

1 gtggcgccgatggacagaat 



cagaggatctccatgaatag 27705 



Scoring table: OLIGO_NUC 

Gapop 60.0 , Gapext 60.0 



Searched : 



Word size : 



2998549 seqs, 2282253817 residues 



200 



Total number of hits satisfying chosen parameters: 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Listing first 500 summaries 



Database : 



1 
2 
3 
4 
5 
6 
7 
8 
9 
10 
11 
12 
13 
14 
15 
16 
17 
18 
19 



Published_Applications_NA: 



/ cgn2_6/ptodata/l/pubpna/US07_PUBCOMB. seq: * 
/ cgn2_6/ptodata/l/pubpna/PCT_NEW_PUB. seq: * 
/ cgn2_6/ptodata/l/pubpna/US06_NEW_PUB. seq: * 
/ cgn2_6/ptodata/l/pubpna/US06_PUBCOMB. seq: * 
/cgn2_6/ptodata/l/pubpna/US07_NEW_PUB. seq: * 
/ cgn2_6/ptodata/l/pubpna/PCTUS_PUBCOMB. seq: * 
/ cgn2_6/ptodata/l/pubpna/US08_NEW_PUB. seq: * 
/ cgn2_6/ptodata/l/pubpna/US08_PUBCOMB. seq: * 
/cgn2_6/ptodata/l/pubpna/US09A_PUBCOMB. seq: * 
/cgn2_6/ptodata/l/pubpna/US09B_PUBCOMB. seq: 
/ cgn2_6/ptodata/l/pubpna/US09C_PUBCOMB. seq: 
/cgn2_6/ptodata/l/pubpna/US09_NEW_PUB. seq: * 
/cgn2_6/ptodata/l/pubpna/US09_NEW_PUB. seq2 : 
/ cgn2_6/ptodata/l/pubpna/US10A_PUBCOMB. seq: 
/ cgn2_6/ptodata/l/pubpna/US10B_PUBCOMB . seq: 
/ cgn2_6/ptodata/l/pubpna/US10C_PUBCOMB. seq: 
/ cgn2_6/ptodata/l/pubpna/US10_NEW_PUB. seq: * 
/ cgn2_6/ptodata/l/pubpna/US60_NEW_PUB. seq: * 
/ cgn2_6/ptodata/l/pubpna/US60__PUBCOMB. seq: * 



* 
* 



Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 



SUMMARIES 



a 
o 



Result Query 

No. Score Match Length DB ID Description 



1 


27705 


100. 


0 


27705 


13 


US- 


09- 


942- 


025- 


12 


2 


27705 


100. 


0 


67311 


13 


US- 


09- 


942- 


025- 


1 


3 


809 


2 . 


9 


20922 


13 


US- 


09- 


942- 


025- 


14 


4 


531 


1. 


9 


11358 


13 


US- 


09- 


942- 


025- 


16 



Sequence 12, Appl 
Sequence 1, Appli 
Sequence 14, Appl 
Sequence 16, Appl 



ALIGNMENTS 



RESULT 1 

US-09-942-025-12 

; Sequence 12, Application US/09942025 

; Publication No. US2003005454 7A1 

; GENERAL INFORMATION: 

; APPLICANT: Kosan Biosciences, Inc. 

; APPLICANT: Julien, Bryan 

; TITLE OF INVENTION: POLYKETIDE SYNTHASE GENE FROM SORANGIUM 

; TITLE OF INVENTION: CELLULOSUM 

; FILE REFERENCE: 300 62-20020.21 

; CURRENT APPLICATION NUMBER: US/09/942,025 

; CURRENT FILING DATE: 2001-08-28 

; PRIOR APPLICATION NUMBER: US 60/271,245 

PRIOR FILING DATE: 2001-02-15 
; PRIOR APPLICATION NUMBER: US 09/144,085 
; PRIOR FILING DATE: 1998-08-31 
; PRIOR APPLICATION NUMBER: US 09/010,809 
; PRIOR FILING DATE: 1998-01-22 
; NUMBER OF SEQ ID NOS : 27 

; SOFTWARE: FastSEQ for Windows Version 4.0 
; SEQ ID NO 12 

LENGTH: 27705 

TYPE: DNA 

ORGANISM: Sorangium cellulosum 
US-09-942-025-12 



Query Match 100.0%; Score 27705; DB 13; Length 27705; 

Best Local Similarity 100.0%; Pred. No. 0; 

Matches 27705; Conservative 0; Mismatches 0; Indels 0; Gaps 



Qy 


l 


GTGGCGCCGATGGACAGAATGCTTGGGTTTCTGCGCGATGCATTGGCGGATCTGCTGCAG 


60 


Db 


l 


1 1 1 1 11 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 I I I M | | | | | || | | || || || | | | 

GTGGCGCCGATGGACAGAATGCTTGGGTTTCTGCGCGATGCATTGGCGGATCTGCTGCAG 


60 


Qy 


61 


GTTGCGCCGGGAGCCGTTCAGGATGATGTTCGATTCCACGAGCAGGGCCTCGACTCCGCG 


120 


Db 


61 


1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 I 1 1 1 1 1 1 1 1 1 1 1 I I I 1 1 M 1 1 1 1 1 II 1 1 1 1 1 1 1 M 
GTTGCGCCGGGAGCCGTTCAGGATGATGTTCGATTCCACGAGCAGGGCCTCGACTCCGCG 


120 


Qy 


121 


AGGGCGCTGCTCCTGGTTGAAAAGCTGTCGAGCTGGACGGGGCGCGCTCTCCCAGCCACG 


180 


Db 


121 


1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 II 1 1 1 I 1 I I | | | | | II 1 1 1 1 I M 1 1 II 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 
AGGGCGCTGCTCCTGGTTGAAAAGCTGTCGAGCTGGACGGGGCGCGCTCTCCCAGCCACG 


180 


Qy 


181 


CTGGTATGGCAGCACAGCACCCTGAACGCGCTTGCTAGGCATCTGGCGGATACAAGCGGG 


240 



